Maternal effects in molecular evolution 
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We introduce a model of molecular evolution in which the fitness of an individual depends both 
on its own and on the parent's genotype. The model can be solved by means of a nonlinear mapping 
onto the standard quasispecies model. The dependency on the parental genotypes cancels from 
the mean fitness, but not from the individual sequence concentrations. For finite populations, the 
position of the error threshold is very sensitive to the influence from parent genotypes. In addition 
to biological applications, our model is important for understanding the dynamics of self-replicating 
computer programs. 

PACS numbers: 87.23.Kg 



Simple models of asexual evolution, such as the qua- 
sispecies model, typically assume that the fitness of an 
organism is a function of only its genotype and the en- 
vironment. This allows for the analysis of evolution 
in static or variable j^-p^ environments, many 

to one mappings from genotype to phenotype (neutral- 
ity) , and phenotypic plasticity . These models 
disregard the potential influence of the mother (or parent, 
in general) on the organism's phenotype. This influence 
comes about because in addition to the genetic material, 
a wealth of proteins and other substances is transferred 
from mother to child. In the context of sexual reproduc- 
tion, the influence of the mother on a child's phenotype is 
usually called a maternal effect. A classic example is that 
of the snail Partula suturalis |l5|, for which the direc- 
tionality in the coiling of the shells is determined by the 
genotype of the mother of an organism, rather than the 
organism's own genotype. Maternal effects are not ex- 
clusive to sexually reproducing organisms, however, they 
can be observed in simple asexual organisms as well. In 
bacteria, for example, the fitness of a strain in a given 
environment may depend on the environment that was 
experienced by the strain's immediate ancestors 

Here, our objective is to define and study a model of 
the evolution of asexual organisms that takes such ma- 
ternal effects into account. We assume that the fitness 
of an organism is given by the product of two quanti- 
ties a and h, where a depends solely on the genotype of 
the mother of the organism, and h depends solely on the 
organism's own genotype. Since we need to keep track 
of the abundances of all possible mother/child combina- 
tions, we need concentration variables if we distinguish 
n different genotypes in our model. In the following, we 
denote by Xij the concentration of organisms of genotype 
j descended from genotype z, and by qji the probability 
that a genotype i mutates into genotype j. The time evo- 
lution of the then, in analogy to the quasispecies 
model, 



^^ji^) = ^akbiqjiXki{t) - J{t)xij{t) , (1) 
k 

with f{t) — X)i j 0'ibjXij{t). The function f{t) gives the 
average fitness of the population at time t. In principle, 
this model can be solved by diagonalizing a x ma- 
trix. However, since even for relatively short genetic se- 
quences the number of possible genotypes n is enormous, 
this direct method is very cumbersome. Fortunately, a 
simple transformation exists that reduces the above prob- 
lem to one in which the diagonalization of a n x rt matrix 
is sufficient. Namely, if we introduce variables Xi such 
that 

Xij = hqjiXi / ( X! ^kXkj , (2) 

k 

then, after inserting Eq. (^) into Eq. (|l|), we obtain in 
the steady state (i^ = 0) 

fx^ = ^ajbjqijXj (3) 

3 

with / = OjbjXj. The inverse transformation, which 
converts Eq. (^ back into the right-hand-side of Eq. (|^), 
can be achieved with 

3 3,k 

Therefore, Eq. (^) is fully equivalent to the steady state 
of Eq. (|^). This leads to an interesting conclusion. Note 
that Eq. (|^) is simply the steady state equation of the 
quasispecies model if we assume that genotypes j repli- 
cate with replication rate Cj = ajbj and mutate into geno- 
types i with qij , while Xi gives the relative concentration 
of genotype i. Consequently, the model with maternal 
effects is mathematically equivalent to the standard qua- 
sispecies model. Moreover, with the aid of Eq. (|^) and 
Eq. (^), it can be shown that f{t —> oo) = /. Therefore, 
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the average fitness in both models is the same; the ma- 
ternal effects drop out of the expression for the average 
fitness. 

While the average fitness depends only on the values 
of Ci , the individual sequence concentrations actually de- 
pend on the details of the maternal effects. In particu- 
lar, the total amount of sequences of a given genotype 
i is not identical to the corresponding value Xi in the 
standard quasispecies model xji ^ Xi in general) . 
From Eq. we see that for every given mutation ma- 
trix qij , we can suppress any sequence concentration to as 
small a level as we please, by reducing the corresponding 
hi and holding the other bj constant (limb;_>o x^j — 0). 
Enhancement of sequence concentrations is also possi- 
ble, although there exists an upper bound that cannot 
be exceeded. The upper bound is given by the matrix 
element qji {ViTHb-^oo Xij = qji). Its existence is easy to 
understand: by changing bi, we can only jointly manip- 
ulate the concentrations of all sequences descended from 
genotype i. The ratio between different genotypes j de- 
scended from i is always fixed, and it is determined by 
the matrix elements Qji. At most, the sum over all de- 
scendants from all genotypes i in the population can be 
one, J2j ^ij = 1; which implies = qji. 

We will now classify the different types of maternal ef- 
fects. If all fli = 1, such that bi = Ci, no maternal effects 
are present, and we obtain the normal sequence concen- 
trations from the quasispecies model. We will refer to 
this situation as the neutral case. In order to classify 
all non-neutral situations, we compare concentrations of 
those sequences that are true copies of their parents (this 
is the only reasonable way, given that all genotypes de- 
scended from the same ancestor i scale identically with 
bi). If the concentration xa of a sequence with large 
Ci is reduced, while the concentration Xjj of some other 
sequence with smaller Cj is enhanced, we will speak of 
positive maternal effects. Likewise, if the sequence con- 
centration of a faster replicating sequence is enhanced, 
at the expense of some slower replicating sequence, we 
will speak of negative maternal effects. In short, positive 
maternal effects promote slower replicators, and negative 
maternal effects promote faster replicators. We refer to 
the above classification as the direction of the maternal 
effects. Likewise, the strengh of the maternal effects in- 
dicates the degree to which a system deviates from the 
neutral case (weak maternal effects show only a small 
deviation, strong maternal effects show a large deviation 
from the neutral case). 

With Eq. (H) we can solve the model, as long as 
there exists an analytical solution for the corresponding 
quasispecies landscape. This means that solutions for 
multiplicative landscapes [ p^Jl9| , the single peak land- 
scape and certain spin-glass landscapes l2l|-|2^ are 
readily available. In the following, we discuss the well- 
known example of the sharp single-peak landscape [ p4[ . 
We assume that the genetic sequences are binary of 



length £. Moreover, we assume a uniform copy fidelity 
q per digit. The sequence 000 ... may replicate (in the 
absence of mutations) with rate cq = ao6o- We will refer 
to this sequence as the master sequence. Let all other 
sequences replicate with ci = aibi <C cq. If i is large, we 
may neglect back-mutations onto the master sequence, in 
which case it is sufficient to keep track of the total concen- 
tration of all sequences off the peak in a single variable, 
xi. The mutation matrix qij is then a 2 x 2 matrix with 
the elements qoo = g^ gio = 1 - q\ qoi = gio/(2^ - 1), 
and = 1 — goi (the elements goi and goi are approx- 
imated). In the standard quasispecies model, the equi- 
librium concentration of the master sequence xq is given 
by 



xo = (cogoo - ci)/(co - ci) 



(5) 



and xi likewise as xi = I — xq- The average fitness follows 
as 



/ = 



Co goo for xo > 0, 
ci else. 



(6) 



Now, for the sequence concentrations with maternal ef- 
fects, we obtain from Eq. (H) in conjunction with Eq. (||) 



xai = 6ogio(cogoo - ci)/A , 
xii = biqaico - cogoo)/A, 



(7) 
(8) 



with A = (5o — bi)coqoo -I- 6iCo — 6oCi and i = 0, 1. 

Figure ^ displays the four sequence concentrations xqo , 
xqi, xio, xii of the above defined landscape, for positive, 
negative, and neutral maternal effects. We see that in- 
deed the maternal effects result in a significant shift in the 
sequence concentrations. The concentration xn (shown 
in the lower right of Fig. |l|), e.g., encompasses almost 
the complete population for positive maternal effects at 
an error rate of about 0.04, while it constitutes less than 
20% in the case of the negative maternal effects for the 
same error rate. 

The potential shift in the individual sequence con- 
centrations has important implications for finite popu- 
lations. When the concentration of a sequence (as pre- 
dicted for an infinite population) approaches the inverse 
of the population size, that sequence will most certanily 
be lost through sampling fluctuations. In the case of the 
master sequence, this effect is responsible for the shift of 
the error catastrophe towards smaller error rates for finite 
populations in the ordinary quasispecies model [p5|-p8|. 
Now, since the concentration of the master sequence can 
be made arbitrarily small with suitable maternal effects, 
it follows that the error threshold can be shifted. This 
effect is illustrated in Fig. |^ for a population size of 
N =^ 1000, for which the error transition in the nor- 
mal quasispecies model (as represented by the 'neutral' 
case) is already significantly shifted. Positive maternal 
effects increase this shift by a fair amount, while nega- 
tive maternal effects can almost completely counteract 
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the finite population effect, and move the error transi- 
tion very close to the infinite population limit. Besides 
the shift in the error transition, Fig. || shows that the 
average fitness is indeed unaffected by strength and/or 
direction of the maternal effects, as all three curves lie 
exactly on the infinite population result for error rates 
below the respective error transitions. 

We have seen above that the mean fitness in the pop- 
ulation is not infiuenced by the existence of maternal 
effects. Since selection acts only on the average fit- 
ness , it follows that the maternal effects cannot be 
under selective pressure. In order to verify this, we have 
performed simulations in which strength and direction 
of the maternal effects were allowed to evolve. To each 
sequence in the population, we added an inheritable vari- 
able z. On reproduction, the offspring received a value 
z' — z + dz, where dz was a normally distributed random 
variable. For the master sequence, z was then mapped 
into oo and bo via ao = {a+z)/a for z > 0, oq = a/{a+z) 
for z < 0, and 60 — I/oq, with a defining the scale be- 
tween z and ao, 60 • For ai and 61, the value of z was ig- 
nored. Figure ^ shows a typical simulation run in such a 
system. We chose N = 1000 and l — q = 0.06, so that the 
population was below the error threshold in the absence 
of maternal effects, and we initialized all sequences in the 
population to z = 0. Over the course of a simulation, the 
z values drift randomly, which can be seen in increasing 
and diminishing fluctuations about the average fitness. 
When the average z drifts below zero, the fluctuations 
decrease, because 2 < corresponds to negative mater- 
nal effects, which shift the population away from the error 
threshold. When the average z drifts above zero, on the 
other hand, the fluctuations increase. If there is no upper 
limit to z, the fluctuations will eventually grow so large 
that the population is pushed over the error threshold. 
In Fig. ^, this happend around generation 5400. 

The model we have introduced in this paper oversim- 
plifies the situation for bacteria, where substances can 
remain in the cytoplasm for several generations, such 
that not only the parent, but also the grand- and the 
grand-grand-parcnt etc. have an influence on the pheno- 
type of an individual. However, it is an exact description 
of the dynamics of the digital organisms (self-replicating 
and evolving computer programs) of the Avida system, 
which has been used extensively in experimental evolu- 
tion research ||30|-^ . The replication rate of these digital 
organisms is the ratio between the number of instruc- 
tions per unit time that they can execute [the speed of 
their central processing unit (CPU)] and the number of 
instructions they have to execute in order to produce a 
new offspring (length of the gestation cycle). The CPU 
speed depends on the number and type of logical oper- 
ations that these organisms perform in addition to their 
replicatory activity (the more logical operations an or- 
ganism performs, the faster its CPU will operate). Since 
the logical operations an organism can perform are only 



known a posteriori, these organisms obtain their initial 
CPU speed from their parent organism. The CPU speed 
corresponds thus to the parameter a of the present work, 
and the length of the gestation cycle to the inverse of the 
parameter b. Therefore, we have shown in that a qua- 
sispecies description of the digital organisms is indeed 
justifled, as was proposed in |^^. Also, our model might 
lead to a detailed quantitative description of the dynam- 
ics of digital organisms in future work. 
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FIG. 1. Relative sequence concentrations vs. error rate 1 — Q- From left to right, and from top to bottom, we display xqo, xio, 
xoi, xii. Solid lines are the analytical predictions Eqs. (Q), (g), dashed lines stem from simulations with A*' = 10000 sequences 
of length I = 20. The parameters of the fitness landscapes were co — 10 and ci = 1, with bo = 0.1, fei = 1 (positive); bo = 1, 
bi = 1 (neutral); bo = 1, bi — 0.1 (negative). 
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FIG. 2. Average fitness vs. error rate 1 — q. The solid line 
represents Eq. (H), and the dashed lines stem from simula- 
tions with A'^ = 1000 sequences of length / = 20. The fitness 
landscapes were identical to Fig. 0. 
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FIG. 3. Average fitness vs. time in a population with evolv- 
ing maternal effects. The dashed line indicates the infi- 
nite population result [Eq. (^]. The population consisted of 
N = 1000 sequencees of Z = 20, the error rate was 1 — q = 0.06, 
and the landscape was defined by co = 10, ci — 1. The scale 
parameter a was set to a = 10. 
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